Vocal tract area function inversion by linear regression of cepstrum
نویسندگان
چکیده
Vocal tract data from 3D cine-MRI are used together with synchronised acoustics to evaluate a linear regression model for inversion. The first two principal components of vocalic area functions are predicted with correlations 0.99 and 0.97 respectively, from 24 FFT-cepstra measured in the frequency band 0-4 kHz. This best regression model together with the two component representation yields mean absolute errors of 0.37 cm in section area and 0.15 cm in vocal tract length.
منابع مشابه
Estimation of vocal-tract shape from speech spectrum and speech resynthesis based on a generative model
Precise control of articulatory parameters is difficult and prevents a physical model from generating natural sounding speech signals. To determine vocal-tract shape from speech, this paper presents an inversion method for simultaneously estimating the cross-sectional area and length of the vocal tract. In addition, we performed speech resynthesis from a time-series of estimated vocal-tract sha...
متن کاملContinuous Voice Morphing Using Separated Vocal Tract Area Functions and Glottal Source Waves
This paper presents a flexible voice morphing method, which is based on a conversion using a linear combination of the vocal tract area functions estimated from speech signals. The method focuses on the continuity of the phonological identity of the overall interpolated area. The main features of the method are 1) to separate characteristics of the vocal tract resonances from those of glottal s...
متن کاملRecovering vocal tract shapes from MFCC parameters
Recovering vocal tract shapes from the speech signal is a well known inversion problem of transformation from the articulatory system to speech acoustics. Most of the studies on this problem in the past have been focused on vowels. There have not been general methods e ective for recovering the vocal tract shapes from the speech signal for all classes of speech sounds. In this paper we describe...
متن کاملVocal tract inversion by cepstral analysis-by-synthesis using chain matrices
Acoustic-to-articulatory inversion for vowels is performed by cepstral analysis-by-synthesis, using chain-matrix calculation of vocal tract (VT) acoustics and the Maeda articulatory model. The derivative of the VT chain matrix with respect to the area function was calculated in a novel efficient manner, and used in the BFGS quasi-Newton method for optimizing a distance measure between input and...
متن کاملVocal tract normalization as linear transformation of MFCC
We have shown previously that vocal tract normalization (VTN) results in a linear transformation in the cepstral domain. In this paper we show that Mel-frequency warping can equally well be integrated into the framework of VTN as linear transformation on the cepstrum. We show examples of transformation matrices to obtain VTN warped Mel-frequency cepstral coefficients (VTN-MFCC) as linear transf...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2005